Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🚀 Tokenizer Performance
Lexical Analysis, Unicode Handling, SIMD Optimization, Streaming
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
184862
posts in
10.0
ms
VihangaFTW/bytetok
: A fast BPE tokenizer with a clean Python API.
github.com
·
1d
·
Discuss:
r/Python
🔤
Language Tokenizers
Task-Centric
Acceleration
of Small-Language Models
arxiv.org
·
22h
⚡
Tokenizer Optimization
TurboSparse
Efficiency: Achieving 97% Parameter Sparsity in
Mixtral-47B
hackernoon.com
·
21m
🌱
Minimal ML
Beyond
Pandas
:
Architecting
High-Performance Python Pipelines
hackernoon.com
·
6h
📋
JSON Parsing
UQLM
: A Python Package for Uncertainty
Quantification
in Large Language Models
jmlr.org
·
14h
📊
LR Parsing
Optimizing
Recommendation Systems with
JDK
’s Vector API
netflixtechblog.com
·
1h
🔀
SIMD Programming
Token
Optimization:
Compressing
Context for Cheaper Agents
sitepoint.com
·
7h
🌊
Streaming Lexers
Qwen 3.5 9B, 4B models beating
30B
,
80B
models
huggingface.co
·
7h
·
Discuss:
Hacker News
🏁
Language Benchmarks
Google AI Introduces STATIC: A Sparse Matrix Framework
Delivering
948x Faster
Constrained
Decoding for LLM Based Generative Retrieval
marktechpost.com
·
1d
⚡
Tokenizer Optimization
Byte-Pair
Encoding
en.wikipedia.org
·
1d
·
Discuss:
Hacker News
📦
Compression Algorithms
Build a serverless conversational AI agent using Claude with LangGraph and managed
MLflow
on Amazon
SageMaker
AI
aws.amazon.com
·
8h
🔍
Tokenizers
Show HN:
Timber
–
Ollama
for classical ML models, 336x faster than Python
news.ycombinator.com
·
20h
·
Discuss:
Hacker News
🌱
Minimal ML
Structured
Outputs
for LLMs
ternarysearch.blogspot.com
·
20h
·
Discuss:
Hacker News
,
ternarysearch.blogspot.com
🪜
Recursive Descent
The
185-Microsecond
Type
Hint
blog.sturdystatistics.com
·
5h
·
Discuss:
Lobsters
,
Hacker News
,
r/programming
⚡
Interpreter Optimization
Architecting
and
Evaluating
an AI-First Search API
research.perplexity.ai
·
5h
🔍
Search Algorithms
OpenAI Codex-Spark Achieves Ultra-Fast Coding
Speeds
on
Cerebras
Hardware
infoq.com
·
12h
🗺️
Region Inference
Chunk-wise
Attention
Transducers
for Fast and Accurate Streaming Speech-to-Text
arxiv.org
·
22h
🔄
Incremental Tokenizers
I Built a Side Project That Works in 4
Languages
dev.to
·
5h
·
Discuss:
DEV
🔤
Language Tokenizers
`
derive
_
parser
` – Automatically
derive
a
parser
from your syntax tree
github.com
·
17h
·
Discuss:
r/rust
🦀
Rust Macros
The Broken Token:
Tokenization
for
Malayalam
Language Models
thottingal.in
·
4d
·
Discuss:
Hacker News
⚡
Tokenizer Benchmarks
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help